Weak-To-Strong Generalization
lesswrong.com·7h
Category Theory
Flag this post
Model welfare and open source
lesswrong.com·7h
Incremental Computation
Flag this post
Get Ready for Clojure, GPU, and AI in 2026 with CUDA 13.0
dragan.rocks·2d·
Discuss: Hacker News
🦀Rust
Flag this post
Secretly Loyal AIs: Threat Vectors and Mitigation Strategies
lesswrong.com·1d
🔍AI Interpretability
Flag this post
Freewriting in my head, and overcoming the “twinge of starting”
lesswrong.com·1d
🗃️Zettelkasten
Flag this post
An intro to the Tensor Economics blog
lesswrong.com·3d
🦀Rust
Flag this post
2025 Unofficial LW Community Census, Request for Comments
lesswrong.com·4h
🌿Digital Gardens
Flag this post
Me consuming five different forms of media at once to minimize the chance of a thought occurring
lesswrong.com·1h
🌿Digital Gardens
Flag this post
Agentic AI and Security
martinfowler.com·4d·
🚀MLOps
Flag this post
Evidence on language model consciousness
lesswrong.com·1d
🔍AI Interpretability
Flag this post
Economics and Transformative AI (by Tom Cunningham)
lesswrong.com·11h
🔍AI Interpretability
Flag this post
Decision theory when you can't make decisions
lesswrong.com·11h
🎯Reinforcement Learning
Flag this post
An Opinionated Guide to Privacy Despite Authoritarianism
lesswrong.com·3d·
Discuss: r/privacy
🏠Self-Hosting
Flag this post
Model Parameters as a Steganographic Private Channel
lesswrong.com·5d
λFunctional Programming
Flag this post
Introducing Project Telos: Modeling, Measuring, and Intervening on Goal-directed Behavior in AI Systems
lesswrong.com·2d
🎯Reinforcement Learning
Flag this post
FTL travel and scientific realism
lesswrong.com·3h
👁️Observability
Flag this post
Seattle Secular Solstice 2025 – Dec 20th
lesswrong.com·17h
🌿Digital Gardens
Flag this post
You’re always stressed, your mind is always busy, you never have enough time
lesswrong.com·11h
Writing
Flag this post